FIGURE 1: Mmel DNA CLEAVAGE 
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FIGURE 2: Mmel DNA SEQUENCE 

1 GAATTCCAGA TAGGTAGTCC TTTGGTACTT CCATCCCAAC CAGTGTCACG 

51 TTCCGCGCCA AACCAATCGG TTAAAGTGTA AGAAAGTCTT GCACTGAAGT 

101 AGCTGTAGGA CAAACCGAAG TTAACCTCTG TGGTATCCCA GCGACCACCT 

151 TTAGGTGTTT GACGGAAGCC TGCTGCGTCA CCTGCCAAGT TATATTTCTT 

201 CCATGAACCA CCTGGGTACA GGTAGCTGAT CAAACCAGCA GTCCAACCCA 

251 AGCCTTCAAT AGCAGGAATA GTTCCGTTAT ACCCACCATA AATATCAATT 

301 TCGGCAGTTG CATCAGGGAA GGTATTTGGT GTCACGTTTG AACCCCATGC 

351 ACCGACATAA AAGCCGCTGT CATGAGTAAT ATCAATACCG CCTTGAACGG 

401 CAGGTTTGTG CCAGTTTTGT GAAATACCAC GAGCATAGTA ATCTGAAACA 

451 AATCCAACGT TTGCAGTAGC AGCCCAGGCT GATTTTTCTT CTTTAGCCTC 

501 TTCAGCTGCG TATGAAACTT GGGCAAAAGA TAATGTGCTT AACACTGCTG 

551 TGAGCAATAT AGATTGACGC ATTATGAGTC CTCTCTCTGT GAAATCTTTG 

601 ATTAAGTTGT TGTAAACGAG AATGAAACAA CAACCACAAA GCAAAGCACG 

651 TGCCAAACTA TAAATAACAT TATAATCAAT TATTTAAAAT ATATTTATAA 

7 01 TC T AAAA.TAT TAAATTAATT ATTTAATAAA CTGTTTTTTA TTGATT'J. AAC 

7 51 TCTAAAACAT ATGGGTGCAA CCACCCTTTT TACTCACTGA TAATGC TAAN 

801 ATAGCCAACA AAGGAGCCTT CACCATGCTG ATTTCAAATG AAAAAATTCA 

851 GGAATTATCT TTAAAAATCA AACAACTAAT CGAATCAAGC CCCATTTCAG 

901 AGCTAAATAA CAACTTGCAT GCACTAATTC AGGGCGCACT CACCAAAATG 

951 GAACTTGTTT CGCGTGAAGA ATTCGATATC CAATCTGCAT TATTAGCGCG 

1001 CACGCAAGAG CAATTAAAAC GTCTTGAAGA AAAAATCAGC CAGCTTGAAG 

1051 AAGGGCAGGC ATCCAGAAAG TAAAAATTAA TTTACAATTG TTAGCATTCC 

ilOJ ATTATTGAGG AGTGCG'-TAT GAGTCTGGCG GlGT ! >/f AC^ GTCGCuCGT'i 



1151 AAGCGGCATG GAGGCGCCAG AAGTGGTGGT AGAAGTCCAC TTGGCGAATG 

1201 GACTACCCAG CTTTACCATT GTTGAAACAT ATTGAAACTT TAAGCCTTAG 

1251 CATTTTTTCA AATATACAAA TGCCCCAAGC TGGTGCATTA AGAAGAATGT 

1301 AACAACTCCC TGCAGACTAG GAATAACTTC ATGATTTAAC GAACATCCCT 

1351 GAGTTTCAAA GTCGAATCTT CTCGTGTTGC AAATTTCTAC AGCTTCCTTT 

1401 CTGACCCTCT TGCACCAAAT TGCACTATGG CGCTAATAAA TCTTCTGCTA 

1451 TCCAATAATG TCCAACTAAC CCTTTATGGA CTCTTAAAAA AGATTTAATA 

1501 AATGATTAAG ATGAATTCAA GGAATTTGAT GCCTGGAAAT ATGGCAAAAG 

1551 CAAAAAGGCA GCCCAGTGCT GACTTTTTTG TTTTAACATT GGCCCATATA 

1601 TCCAATTTCA AATAATTTAA AAATTATCGG GAGCTAATCT GTGGC TTTAA 

1651 GCTGGAACGA GATAAGAAGA AAAGCTATTG AGTTTTCTAA AAGATGGGAA 

1701 GACGCCTCAG ATGAAAACAG TCAAGCCAAA CCCTTTTTAA TAGATTTTTT 

1751 CGAAGTTTTT GGAATAACTA ATAAGAGAGT TGCAACATTT GAGCATGCTG 

1801 TGAAAAAGTT CGCCAAGGCC CATAAGGAAC AATCTCGAGG ATTCGTAGAT 

1851 TTGTTTTGGC CTGGCATTCT TCTTATTGAA ATGAAAAGCA GAGGTAAAGA 

1901 CCTCGACAAA GCGTATGACC AGGCACTTGA TTACTTTTCT GGCATTGCAG 

1951 AAAGAGACTT ACCCAGATAC GTTTTAGTTT GCGACTTCCA GCGTTTCAGA 

2001 TTAACAGACC TAATAACAAA AGAGTCAGTT GAATTTC TTT TAAAGGACTT 

2051 ATACCAAAAT GTGAGGTCTT TTGGTTTTAT AGCTGGTTAT CAAACTCAAG 

2101 TAATCAAGCC ACAAGACCCT ATTAATATTA AGGCGGCTGA ACGGATGGGT 

2151 AAGCTTCATG ACACCCTGAA GTTGGTTGGA TATGAGGGAC ACGCTTTAGA 

22 01 AC TTT ATC T A GTGCGTTTAC TTTTTTGCTT ATTCGCAGAA GACACAACTA 

22 51 TTTTTGAGAA AAGTTTATTC CAAGAATATA TCGAGACAAA GACGCTAGAG 

2 3 01 GACGGCAGTG ACCTTGCACA TCATATCAAT ACACTTTTTT ATGTTCTCAA 

ij>Si :ACco.,h3:^ c/i/aaaagat taaagaat'Tt agacgaacac ctt^ctc^t 



2401 TTCCATATAT CAATGGAAAA CTTTTCGAGG AGCCACTTCC GCCAGCTCAG 
2451 TTTGATAAAG CAATGAGAGA GGCATTGCTT GACTTGTGCT CATTAGATTG 
2501 GAGCAGGATT TCACCAGCAA TATTTGGAAG TTTATTCCAA AGCATTATGG 
2551 ATGCTAAAAA GAGAAGAAAT CTTGGGGCAC ACTACACCAG CGAAGCAAAT 
2601 ATTCTCAAGT TAATCAAGCC ATTGTTTCTT GACGAGCTCT GGGTAGAGTT 
2651 CGAGAAAGTT AAAAATAATA AAAATAAATT ACTAGCGTTC CACAAAAAAC 
2701 TAAGAGGACT TACATTTTTC GACCCTGCAT GCGGTTGCGG AAATTTTCTT 
2751 GTAATCACAT ACCGAGAACT AAG AC TTTT A GAAATTGAAG TGTTAAGAGG 
2801 ATTGCATAGA GGTGGTCAAC AAGTTTTGGA TATTGAGCAT CTTATTCAGA 
2851 TTAACGTAGA CCAGTTTTTT GGTATCGAAA TAGAGGAGTT TCCCGCACAG 
2901 ATTGCTCAGG TTGCTCTCTG GCTTACAGAC CACCAAATGA ATATGAAAAT 
2951 TTCAGATGAG TTTGGAAACT ACTTTGCCCG TATCCCACTA AAATCTACTC 
3001 CTCACATTTT GAATGCTAAT GCTTTACAGA TTGATTGGAA CGATGTTTTA 
3051 GAGGCTAAAA AATGTTGCTT CATATTAGGA AATCCTCCAT TTG TTGGTAA 
3101 AAGTAAACAA ACACCGGGAC AAAAAGCGGA TTTACTATCT GTTTTTGGAA 
3151 ATC TTAAATC CGCTTCAGAC TTAGACCTAG TTGCTGCTTG GTATCCCAAA 
3201 GCAGCACATT ACATTCAAAC AAATGCAAAC ATACGCTGTG CATTTGTCTC 

32 51 AACGAATAGT ATTACTCAAG GTGAGCAAGT ATCGTTGCTT TGGCCGCTTC 
3301 TGCTCTCATT AGGCATAAAA ATAAACTTTG CTCACAGAAC TTTCAGCTGG 

33 51 ACAAATGAGG CGTCAGGAGT AGCGGCGGTT CACTGCGTAA TTATCGGATT 

34 01 TGGGTTGAAG GATTCAGATG AAAAAATAAT CTATGAGTAT GAAAGTATTA 
3 4 51 ATGGAGAACC ATTAGCTATT AAGGCAAAAA ATATTAATCC ATATTTGAGA 
3501 GACGGGGTGG ATGTGATTGC CTGCAAGCGT CAGCAGCCAA TCTCAAAATT 
3 551 ACCAAGCATG CGTTATGGCA ACAAACCAAC AGATGATGGA AATTTCCTAT 
3 601 TiACToACCA AG AAAAAAAC C AATTT A T'T A OAA A7GAGCC ^TC^T^OAA 



3651 AAATACTTCA GACGGTTTGT GGGCGGGGAT GAGTTCATAA ACAATACAAG 

3701 TCGATGGTGT TTATGGCTTG ACGGTGCTGA CATTTCAGAA ATACGAGCGA 

3751 TGCCTTTGGT CTTGGCTAGG ATAAAAAAAG TCCAAGAATT CAGATTAAAA 

3801 AGCTCGGCCA AACCAACTCG ACAAAGTGCT TCGACACCAA TGAAGTTCTT 

3851 TTATATATCT CAGCCGGATA CGGACTATCT GTTGATACCT GAAACATCAT 

3901 CTGAAAACAG ACAATTTATT CCAATTGGTT TTGTTGATAG AAATGTCATT 

3951 TCAAGTAACG CAACGTATCA TATTCCTAGT GCTGAACCTT TGATATTTGG 

4001 CCTGCTTTCA TCGACCATGC ACAACTGCTG GATGAGAAAT GTAGGAGGAA 

4051 GGTTAGAAAG TCGTTATAGA TATTCTGCCA GCCTGGTTTA CAACACGTTT 

4101 CCATGGATTC AACCCAACGA AAAACAATCG AAAGCGATAG AAGAAGCTGC 

4151 ATTTGCGATT TTAAAAGCTA GAAGCAATTA TCCAAACGAA AGTTTAGCTG 

4201 GTTTATACGA CCCAAAAACA ATGCCTAGTG AGCTTCTTAA AGCACATCAA 

4251 AAACTTGATA AGGCTGTGGA TTCTGTCTAT GGATTTAAAG GACCAAACAC 

43 01 AG AAATTGC T CGAATAGCTT TTTTGTTTGA AACATACCAA AAGATGACTT 
4351 CACTCTTACC ACCAGAAAAA GAAATTAAGA AATCTAAGGG CAAAAATTAA 
4401 TTAATGTATT TAACATTAAA CCACCCTGAT TTATTTCGAA TAGTTCAAAT 

44 51 GCTTCCATGT GGACTAATCG CCTTCAATCA TATTAAAAAA CCGACGCTAG 
4 501 TAATAAAAAC TTCCAAAGAG GCCATATTAA CCGCCAAAAT TAATCGTGAA 
4551 TTTAAAATAT ATCTTTATCA AACCACATCG GCTTGTGTTC TAGTAAGTGC 
4 601 ATTTTTTGAC GATTCTGATA GTCCACTATT CATTACAACA CCAATTGTTC 
4651 GAGATGACCA ACACTCCTTA GACTTGTTAA GATTTTTAAT CAACAATGAT 
47 01 TTTACGATTT GCTTCTTTGA TGAACTGAAC CGAGAATTTC TTTCCGTTAA 
47 51 CGCAACTGGT AATTTAGTCT CTATCTTTGA GAGCATTCAC TTGATGCCAC 
4 801 TGCCGAGCCC AGAGGAAGCC CACAATGCAT TGAATGAAGC GGAATTTTGG 
-13 r V; 'WAGTTTAC GCTCAGCTCC TGATG Vl'GAA ™C ATC T h TC r ZVOGT^CTTT 



4901 ATTGGATAAT CTATTTCCTG ACGATTTTGT AATTTATGAC CTATCCTCAA 
4951 ACAAAAACGA TATGACATCA TTGGTTAGAG AAACTAAACC AGGATACTAT 
5001 CAGGAAGCAG ATATTGCAAA GTTACTAACA AGAGCTTTTA GTTTGGAAAG 
5051 CATTTATCAG AATCCAGTGA AAACAAGCGA TTCAAAAGAG TTGGCAGACG 
5101 TTGTGGTATT CGGCCAAAAG GAAATTTTAA TAATTCAAGC TAAAGATAGT 
5151 GAAAACAATC AGAAACAAGT TTTAGAGGTT TCGTTAGACA AGAAATGCGC 
5201 AAAGTCTTCA AAGAAACTTT CTGAAGCTTT GGCACAACTC ACCGACACTA 
5251 TCTTAACAAT ATCCAATACA CCAATAGTTG ATGTTCGGGT TGGTAAGAAA 
5301 AAATGCACTC TGAACTTTGA GGGAAAGCAG CTTATTGGTA TCGTCGTTGT 

53 51 TAAAGAGCTT TTTAATGATA TTTACGATAA ATACAGTCAA AAAGTTTTTG 
5401 AGCATGTAGA GTTGTCTAAA GCACCCATTG TCTTCTTTGA CTATCCAGAA 

54 51 TTTGCAAGAA TGACATTTCA TTGTAATTCT GAGGAATTAT TACTTTATGC 
5501 TTTGCATAGG ATATTTAGTT CTGCAATAGA AAATGGAATG TATAAACGAT 
5551 TGAGATTTAC TCAACCTATC ATAACTGATG GTCATGACAG CTACTTCAGG 
5601 ATACAAAACA GGCCCCATTC TGATGAGGCC TATTTAATTT GCACAGAGGA 
5651 TGAAATGAAG CTCTCAAATA AGTTTAAAGA CTAAATTTAT ATTTTC C TC A 

57 01 GTATCTTAAA AACAATATTC ATTA^.ATTGG AAAGCCCGCA ATGATICTTG 
5751 CAGTATCAAT GCGGGCATCA GTATCCAGCT CTTGCAATAC ACGGAAGTAT 
5801 CAAGAAGCGA ATCAGGATTC TAACCATACC TTTTTAATTG CAACAATCTA 

58 51 ATTTCC ATAA CATGTGTAGC TACATCGAAA AAAAGACCTC GAAGAGGTTG 
5901 CAAGAGCGTC CAGCTCGCGG CATCAAAAGA CCCTAGTCTT TTGACAAGGG 
5951 GGAGCCAAAA AACTGAGGTG GAGGAGCTTG CCGACGAAGC CAGGAAGCCC 
6001 CAGCGTCCGG 



FIGURE 3: Mmel AMINO ACID SEQUENCE 

1 MALSWNEIRR KAIEFSKRWE DASDENSQAK PFLIDFFEVF GITNKRVATF 

51 EHAVKKFAKA HKEQSRGFVD LFWPGILLIE MKSRGKDLDK AYDQALDYFS 

101 GIAERDLPRY VLVCDFQRFR LTDLITKESV EFLLKDLYQN VRSFGFIAGY 

151 QTQVIKPQDP INIKAAERMG KLHDTLKLVG YEGHALELYL VRLLFCLFAE 

201 DTTIFEKSLF QEYIETKTLE DGSDLAHHIN TLFYVLNTPE QKRLKNLDEH 

251 LAAFPYINGK LFEEPLPPAQ FDKAMREALL DLCSLDWSRI SPAIFGSLFQ 

301 SIMDAKKRRN LGAHYTSEAN ILKLIKPLFL DELWVEFEKV KNNKNKLLAF 

351 HKKLRGLTFF DPACGCGNFL VITYRELRLL EIEVLRGLHR GGQQVLDIEH 

401 LIQINVDQFF GIEIEEFPAQ IAQVALWLTD HQMNMKISDE FGNYFARIPL 

451 KSTPHILNAN ALQIDWNDVL EAKKCCFILG NPPFVGKSKQ TPGQKADLLS 

501 VFGNLKSASD LDLVAAWYPK AAHYIQTNAN IRCAFVSTNS ITQGEQVSLL 

551 WPLLLSLGIK INFAHRTFSW TNEASGVAAV HCVIIGFGLK D3DEKIIYEY 

601 ESINGEPLAI KAKNINPYLR DGVDVIACKR QQPISKLPSM RYGNKPTDDG 

651 NFLFTDEEKN QFITNEPSSE KYFRRFVGGD EFINNTSRWC LWLDGADISE 

7 01 IRAMPLVLAR IKKVQEFRLK SSAKPTRQSA STPMKFFYIS QPDTDYLLIP 

7 51 ETSSENRQFI PIGFVDRNVI SSNATYHIPS AEPLIFGLLS STMHNCWMRN 

801 VGGRLESRYR YSASLVYNTF PWIQPNEKQS KAIEEAAFAI LKARSNYPNE 

851 SLAGLYDPKT MPSELLKAHQ KLDKAVDSVY GFKGPNTEIA RIAFLFETYQ 

901 KMTSLLPPEK EIKKSKGKN* 



FIGURE 4: pTBMmel . 1 IS RESISTANT TO Mmel CLEAVAGE 



1 2 3 4 5 6 7 



. .. M W I ■> 












Ml 








WW* 


■ • " '1 











FIGURE 5: Mmel CLEAVAGE OF HEM I -METHYLATED SUBSTRATES 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 



FIGURE 6: METHYLAT ION INCORPORATION BY Mmel ENDONUC LEASE 



Top Strand: / 

5'-TCCGAC-3' / 

unmethylated: / 

unmethylated: / 

methylated: / 

methylated: / 



Bottom strand: 
5 ' -GTCGGA-3 ' 

unmethylated 

methylated: 

unmethylated: 

methylated: 



3 H-COUNTS 

19,972 
14,447 
1,266 
917 



'A' indicates position of N6-methyl adenine in the DNA 
substrate 



Mmel Figure 7 : Multiple Sequence Alignment of Mmel and Homologous 
Polypeptides 

PileUp of: @mme.list2 

Symbol comparison table: GenRunData :blosum62 .cmp CompCheck: 1102 

GapWeight : 6 
GapLengthWeight : 1 

Name: mmelfeLORF3P = gi | 28373198 | ref | NP_783835 . 1 | (SEQ ID NO:3) 

Name: mmeLrel21P = gi | 23451826 | gb |AAN32874 . 1 |AF461726_1 (SEQ ID NO:4) 

Name: mme = Mmel amino acid sequence {SEQ ID NO: 5) 

Name: mmeNMA1791 = gi | 15794682 | ref | NP_284504 . 1 | (SEQ ID NO:6) 

Name: mmeBSU0677 = gi | 16077744 | ref | NP_388558 . 1 | (SEQ ID N0:7) 

Name: mmegcry - gi | 9945797 | gb j AAG03371 . 1 | (SEQ ID NO:S) 

Name: mmePflQ8 = gi | 23451826 | gb | AAN32874 . 1 | AF461726_1 (SEQ ID NO:9) 

Name: saro3834 = gi | 23110638 j gb | ZP_00096791 . 1 | (SEQ ID NO: 10) 

Name: mmeMSI135 = gi | 20803963 | emb | CAD31540 . 1 | (SEQ ID NO:ll) 

Name: mmeCC0826 = gi | 16125079 | ref |NP_419643 . 1 | (SEQ ID NO:12) 

Name: mmeDR0119.1 = gi | 15807788 | ref | NP_285443 . 1 | (SEQ ID NO:13) 

Name: mmeDR2267 = gi | 15807258 | ref | NP_295988 . 1 | (SEQ ID NO:14) 



mmelfeLORF3P 
mmeLrel21P 
mme 

mmeNMAl791 
mmeBSU0677 
mmegcry 
mmePf iQ8 
saro3834 
mmeMSI135 
mmeCC0826 
mmeDR0119 .1 
mmeDR2267 



50 

-MPT RQQAAREFVK TWS . SDKKGR EDADRQTFWN 



MALSWNE IRRKAIEFSK RWE.DASD.. ENSQAKPFLI 

MKTLLQ LQTAAQNFAA YYK.DQTD.. ERREKDTF*N 

MALID LEDKIAEIVN R.E.DHSD FLY 

MVMAPTTVFD RATIRHNLTE FKLRWLDRIK QWEAENRPAT ESSHDQQFWG 



-MSLGAAGL TPITPAAFIK KWRKSELG. . ERQAAQEHFL 

— MTPAQFVK KWSDSQLR. . ERQASQEHFL 

, — MHPQEFAD TWS RRALKAT ERDSYVQHWL 

MPQTE TAQRMEDFVA YW . . RTLKGD EKGESQV . FL 



mmelfeLORF3P 
mmeLrel21P 
mme 

mmeNMA17 91 
mmeBSU0677 
mmegcry 
mmePf 1Q8 
saro3834 
mmeMSI135 
mmeCC0826 
mmeDR0119 . 1 
mmeDR22 67 



51 100 
DLLQRVYGID N.YYDYITYE KDVQVKADGK VTTRRIDGYI P . STKIMVEM 

DFFE . VFGIT N.KR. . .VAT FEHAVKKFAK AHKEQSRGFV DL . . . FWPGI 
EFFA.IFGID R.KN. . . VAH FEYPVKD . . P ADNTQ . . . FV DI . . . FWEGI 
ELLG.VYDVP R.AT. . . ITR . . . .LKK.GN QNLTKRVGEV HLKNKVW. . . 
DLLDC.FGV. N.ARDLYLY QRSAK RASTGRTGKI DM . . . FMPGK 



M 

D.ICSLVGHP SP.SDEDPTG AFFAFEKGAN KLG . GGKGFA D.VWK-.KGH 
D . LCRMLEVP TP . AEDDPLG ERYCFERGAA KTG . GGDGWA D . VWR . . KGC 

D.LCQLLHHE APGADPD YKFERRVT KVGTKDKGFA D . VFK . . KAH 

DRLFQAFGH. . . . AGYKEAG AE . . LEYRVA KQG . GGKKFA DLLWR . . PRV 



mmelfeLORF3P 
mmeLrel21P 
mme 

mmeNMAl7 91 
m.«.CsBS005 77 



101 

KGKNIKDLSK 



PITQSGGD ELT PFEQAKRYAN FLPN . 



150 
.SEQ 



LLIEMKSRGK DL D KAY D . . QALDYFS GIAERD...L 

FLAEHKSANK NL T KAK E . . QAERYLQ EIGRTKPSAL 

. b'KEAK . KCK LF D hu\ CIEQQVEYL CAK 



mmegcry VIGEAKSLGV PLDDA YAQALDYLL G.GTIANSHM 



mmePf 1Q8 
saro3834 
mmeMSI135 
mmeCC082 6 
mmeDR0119.1 
itimeDR2267 



NPVEIEEAVS DLARAPYDAS EFPFQFLAAF GNKQTTLQRL RAGNSNQSDL 

FAWEYKRKKG NLDEA LLQLMRYAP AL 

FGWEYKGKHK NLDAA ( LRQLQAYAL DL 

FITEYKRPGS DLGAA ' LQQATLYSR DL 

LI . EMKKRGE KLANH YQQAFDYWL KL 



mmelfeL0RF3P 
mmeLrel21P 
mme 

iraneNMA1791 
mmeBSU0677 
mmegcry 
mmePf 1Q8 
saro3834 
mmeMSI135 
mmeCC0826 
mmeDR0119 . 1 
mmeDR2267 



mmelfeL0RF3P 
mmeLrel21P 
mme 

mmeNMA1791 
mmeBSU0677 
mmegcry 
mmePf 1Q8 
saro3834 
mmeMSI135 
mmeCC0826 
mmeDR0119.1 
mmeDR22 67 



151 

PR WILVSNFNEI DIHDM. .E. 



200 



. RPLDEPKVI KL. 



PR YVLVCDFQRF RLTDLITK. . .ESVE....F LL. 

PE YYAVSDFAHF HLYRRVPE . . . EGAENQWQF PL . 

PR YLLVTDYDGV LAKDTKTL. . . EALDVKF 

PA YWCSNFETL RVTRLNRTYV GDSADWDITF PL. 



PGAVLQRNHI HIATCDAGNV DRTLAALRKS PKTASQKARF ILATDGVAFQ 

L..SPP LHIVCDIERL RIHTAWTNTV PSTY. .VITL DDLAE 

Q..NPP YLWSDMERI IVHTNWTNTI SRKI..EFTL DDLHE 

G..NPP LLLTSDFQRI EINTAFTGTS PKSY..LITL DDIAENRWG 

VPDRPR YAVLCNFDEL WVYDF NQQ L DEPMDRLRI . 

201 250 

. EDLPKKVKS L E F MVDA NQQQVIDEKQ LSVDAGNLVA 

. KDLYQNV . R S FGFIA...GY QTQVIKPQDP INIKAAERMG 

. EELPEYITR G V FDFMF...GI EAKVRQIQEE ANIQAAATIG 

. EELPQY FDFFLAWKGI EKVEFEKENP ADIKAAERFA 

. AEIDEHIEQ L A F LADY ETSAYREEEK ASLEASRLMV 

AEDMASGETV ACNYAAFPDK FAFFLPLAGI TTVQQIRESS FDIKATGRLN 

. . . . PSAREM LHNVFFSPEK L RPTR. . TRAA VTKEAaDKFS 

PEKLAM LRQVFDGSDS L KPKI..SPQE LTAKVAQRFG 

GNDVP.ALQI LHSALHQPYD L DPRL . . FRER ITTDATRQVG 

. EELPERYTV LNFMFEQ . . E R APLFGNNRVD VTREAADSVA 



mmelfeLORF3P 
mmeLrel21P 
mme 

mmeNMA1791 
mmeBSU0677 
mmegcry 
mmePf 1Q8 
saro3834 
mmeMSI135 
mmeCC0826 
mmeDR0119 . 1 
mmeDR2 267 



251 

KIYNELTNAY AAGRGIDVN. 



300 

. .EPRIQRS LN..MLIVRL VFLLYADDSN 



KLHDTL . K LVG YE . GHA LE . . LYLVRL LFCLFAEDTT 

RLHDAL.K EEG. ...IYE.EHE LR..LFITRL LFLFFADDSA 

RIYDVLRK ENN. ...IIETNRG LD..LFLIRL LFCFFAEDTD 

ELFRAMNGDD VDEAVGDDAP TTPEEEDERV MRTSIYLTRI LFLLFGDDAG 

KLYVELLKDN PDWA SRS EDMNHFMARL IFCFFAEDTD 

AI ALRVQGR . G.TPD EIAHFVNQL VFCFFAQSVS 

DLGRRLQER . GHHPR DVAHFLNRV VFCMFAEDAK 

LVARRLGERE GRT RAAHMMMRV VFALFAEDTG 

KVLNSVIAR . GEDRA RAQRFLLQC VMAMFAEDFE 



mmelfeLORF3P 
mmeLrel21P 
mme 

mmeNMAl7 91 
mmeBSU0677 



301 

LFGKEDIFQA FIER. 



350 

.REP RDIRRDLSEL FKVLDQP . EE QRDPYLDDEF 



IFEKS.LFQE YIETKTLEDG SDLAHHINTL FYVLNTP . EQ KRLKNLDEHL 

VFRRNYLFQD FLE . . NCKEA DTLGDKLNQL FEFLNTP . DQ KRSKTQSEKF 

IFKRNS.FTN LIKTLTEEDG SNLNKLFADL FIVL....DK NERDDVPSYL 

mmegcry LWDTPHLFAD FVRNETTPE. . SLGPQLNEL FSVLNTA . PE KRPKRLPSTL 



saro3834 
mmeMSI135 
mmeCC0826 
mmeDR0119 .1 
mmeDR2267 



mmelfeL0RF3P 
mmeLrel21P 
mine 

mmeNMA1791 
mmeBSU0677 
mmegcry 
mmePf 1Q8 
saro3834 
mmeMSI135 
nimeCC0826 
mmeDR0119.1 
mmeDR22 67 

mraelfeL0RF3P 
itimeLrel2lP 
mme 

mmeNiy[A1791 
mmeBSU0677 
mmegcry 
mmePf 1Q8 
saro3834 
mmeMSI135 
mmeCC0826 
mmeDR0119 . 1 
mmeDR2267 



mmelfeL0RF3P 
mmeLrel21P 
mme 



IFVGEGLFSR TVETMSARDA SDTHMVIAEI FRAMDTRLAD RAAAGIKSWA 

LLPD.GLFTK LLK.RSARAP ERAMSYLDKL FEAME RGGEF . . . DL 

LLPE . GLFTR LTRSMQMRPP AEAAPQFDAL FAMMR AGGMF . . . GA 

MLER.GIVTR LLE.RARAPP GEDQLYFQDL FGAMK GGGEF...WG 

LI PR . GFFTE LADD . ARAGR GSSFDLFGGL FRQMNTSERA RGGRF 

351 400 
NQFAYVNGGM FSDENVIIPQ FTDELKRLIV EDAGRGFDWS GISPTIFGAV 

AAFPYINGKL FEEPLPPA.Q FDKAMREALL .DLCS.LDWS RISPAIFGSL 
KGFEYVNGGL FKERLRTF . D FTAKQHRALI .DCGN.FDWR NISPEIFGTL 
KEFPYVNGQL FTEPHTEL . E FSAKSRKLII . ECGELLNWA KINPDIFGSM 
AKFPYVNGAL FAEPLAS . EY FDYQMREALL AAC..DFDWS TIDVSVFGSL 



DVFPYVNGQL 
TDITWFNGGL 
DIVHWFNGGL 
TDIRHFNGGL 
APIPYFNGGL 
401 

FESTLN . PET 



FSGS.TECPR 
FDGR . . RALR 
FDEK . . PALP 
FDSE . . DALA 
FRAV. .DPIE 



FSKIARSYLL 
LDDGDIGLL . 
LERADIKLIH 
LTSEDAAAL. 
LNRDELYLLH 



H. . IGSLDWQ 
. VAADGLDWG 

DTAAEH . DWS 
. I I AAKLDWS 

KAALEN.NWA 



RRSGGMHYTS IENIHKVIDP LFLNDLHDEF 



KINPDIFGSM 
LIDPTIFGTL 
DLDPSVFGNM 
EVEPSIFGTL 
RIQPQIFGVL 
450 

D 



FQSIMD . AKK RRNLGAHYTS EANILKLIKP LFLDELWVEF E. 

FQSVMD.AQE RREAGAHYTE AANIDKVING LFLENLRAEF E. 

IQAVAS . EES RSYLGMHYTS VPNIMKVIKP LFLDKLNQSF . . 
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